Path comparison unit for determining paths in a trellis that compete with a survivor path

ABSTRACT

A path comparison unit is disclosed for determining paths in a trellis that compete with a survivor path. The disclosed path comparison unit comprises a first type functional unit comprising a multiplexer and a register to store one or more survivor bits associated with the survivor path; and at least two second type functional units, wherein each second type functional unit comprises a multiplexer and a logical circuit to compute at least one equivalence bit indicating whether the bit for a respective path and the bit for the survivor path are equivalent. Generally, the respective path is one or more of a win-lose path and a lose-win path.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of U.S. patent application Ser. No. 11/045,585, filed Jan. 28, 2005 now U.S. Pat. No. 7,607,072; and is related to U.S. Pat. No. 7,487,432; each incorporated by reference herein.

FIELD OF THE INVENTION

The present invention relates generally to equalization, detection and decoding techniques using the Soft-Output Viterbi Algorithm (SOVA).

BACKGROUND OF THE INVENTION

A magnetic recording read channel converts an analog read channel into an estimate of the user data recorded on a magnetic medium. Read heads and magnetic media introduce noise and other distortions into the read signal. As the information densities in magnetic recording increase, the intersymbol interference (ISI) becomes more severe as well. In read channel chips, a Viterbi detector is typically used to detect the read data bits in the presence of intersymbol interference and noise.

The Soft-Output Viterbi Algorithm (SOVA) is a well known technique for generating soft decisions inside a Viterbi detector. A soft decision provides a detected bit with a corresponding reliability. These soft decisions can be used by an outer detector to improve the error rate performance of the overall system. For a more detailed discussion of SOVA detectors, see, for example, J. Hagenauer and P. Hoeher, “A Viterbi Algorithm with Soft-decision Outputs and its Applications,” IEEE Global Telecommunications Conference (GLOBECOM), vol. 3, 1680-1686 (November 1989). SOVA architectures exist for one-step trellises, where one soft decision is generated per clock cycle. SOVA detectors may be implemented, for example, in next-generation read channel systems, and data rates in excess of 2 Gigabits-per-second will have to be achieved. It is challenging to achieve such high data rates with existing SOVA architectures that consider one-step trellises.

A need therefore exists for a method and apparatus for performing SOVA detection at the high data rates that are required, for example, by evolving high-end storage applications. A further need exists for a method and apparatus for performing SOVA detection employing a multiple-step trellis.

SUMMARY OF THE INVENTION

Generally, a path comparison unit is disclosed for determining paths in a trellis that compete with a survivor path. The disclosed path comparison unit comprises a first type functional unit comprising a multiplexer and a register to store one or more survivor bits associated with the survivor path; and at least two second type functional units, wherein each second type functional unit comprises a multiplexer and a logical circuit to compute at least one equivalence bit indicating whether the bit for a respective path and the bit for the survivor path are equivalent. Generally, the respective path is one or more of a win-lose path and a lose-win path. A plurality of the first type functional units can be connected in a register exchange architecture.

A functional element comprises the first type functional unit and the two second type functional units comprise. In one embodiment, the first type functional unit and the two second type functional units within a functional element operate in parallel. The logical circuit, such as an exclusive OR (XOR) function, can compare an output of two multiplexers in the same functional element. A plurality of the functional elements are connected according to the trellis.

A plurality of the functional elements can be configured in an array structure having a plurality of rows and a plurality of columns. In one embodiment, an output of one of the first type functional units in a first column is applied as an input to at least one functional element in a subsequent column. A row in the array structure can correspond to a given state and a column in the array structure can correspond to a one-step-trellis period.

Inputs to a given multiplexer comprise at least one survivor bit from at least one functional element of a previous column. The multiplexers in the first type functional units in a given row of the array structure are typically controlled by the same selection signal. In one embodiment, at least two second type functional units in a given row of the array structure comprise first and second multiplexers operating in parallel and wherein the first multiplexers in the given row are controlled by a first selection signal and the second multiplexers in the given row are controlled by a second selection signal. In one implementation, a first path comparison unit is provided for bits corresponding to even one-step-trellis periods and a second path comparison unit is provided for bits corresponding to odd one-step-trellis periods.

According to another aspect of the invention, a circuit is disclosed that comprises a first type functional unit comprising a multiplexer and a register; and at least two second type functional units, wherein each second type functional unit comprises a multiplexer and a logical circuit, wherein (i) the first type functional unit and the at least two second type functional units operate in parallel, (ii) the multiplexers in the first type functional unit and the at least two second type functional units each process a same set of input signals, (iii) a first logical circuit in a first of the at least two second type functional units processes an output from a first multiplexer in the first of the at least two second type functional units and an output from the multiplexer in the first type functional unit; and (iv) a second logical circuit in a second of the at least two second type functional units processes an output from a second multiplexer in the second of the at least two second type functional units and an output from the multiplexer in the first type functional unit.

A more complete understanding of the present invention, as well as further features and advantages of the present invention, will be obtained by reference to the following detailed description and drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a one-step trellis diagram for a channel with memory L=2;

FIG. 2 illustrates the two-step SOVA for the one-step trellis shown in FIG. 1;

FIG. 3 is a schematic block diagram for a SOVA implementation employing a one-step trellis;

FIG. 4 illustrates a one-step trellis for a channel with memory L=3;

FIG. 5 illustrates a two-step trellis for a channel with memory L=3;

FIG. 6 a schematic block diagram showing a SOVA implementation for a two-step trellis;

FIG. 7 illustrates a detailed schematic block diagram of a SOVA implementation for a two-step trellis;

FIG. 8 illustrates the path metric differences computed by a SOVA detector for a two-step trellis;

FIG. 9 is a schematic block diagram showing an exemplary implementation of the ACS operation of FIG. 7 and the generation of path metric differences Δ⁻¹ and Δ₀;

FIG. 10 is a schematic block diagram showing an alternate implementation of the ACS operation of FIG. 7 and the generation of the path metric differences Δ⁻¹ and Δ₀;

FIG. 11 is a schematic block diagram showing an exemplary implementation of the survivor memory unit of FIG. 7;

FIG. 12 is a schematic block diagram showing an exemplary implementation of the path comparison of FIG. 7 for bits corresponding to even one-step-trellis periods;

FIG. 13 is a schematic block diagram showing an exemplary implementation of the path comparison of FIG. 7 for bits corresponding to odd one-step-trellis periods; and

FIG. 14 is a schematic block diagram showing an exemplary implementation of the reliability update of FIG. 7 for the maximum-likelihood (ML) path.

DETAILED DESCRIPTION

The present invention recognizes that the limitation on achievable data rates in a SOVA detector is overcome by employing a multiple-step trellis. The multiple-step trellis is obtained from a one-step trellis by collapsing transitions over multiple time steps into one. In other words, each transition in the multiple-step trellis corresponds to multiple transitions in the one-step trellis. For example, in an exemplary two-step trellis, each transition in the two-step trellis corresponds to two transitions in the original one-step trellis. SOVA detectors in accordance with the present invention can operate at data rates that are about twice the data rates of conventional designs that use one-step trellises. Even larger speed-ups are achievable for multiple-step trellises with step sizes larger than two.

One-Step SOVA

The present invention is illustrated in the context of a two-step SOVA, where Viterbi detection is followed by reliability processing. For a discussion of suitable two-step SOVA architectures for one-step trellises, see, for example, O. J. Joeressen and H. Meyr, “A 40-Mb/s Soft-Output Viterbi Decoder,” IEEE J. Solid-State Circuits, vol. 30, 812-818 (July, 1995), and E. Yeo et al., “A 500-Mb/s Soft-Output Viterbi Decoder,” IEEE Journal of Solid-State Circuits, vol. 38, 1234-1241 (July, 2003). The present invention applies, however, to any SOVA implementation, as would be apparent to a person of ordinary skill in the art. For a discussion of suitable one-step SOVAs, see, for example, J. Hagenauer and P. Hoeher, “A Viterbi algorithm with Soft-Decision Outputs and its Applications,” IEEE Global Telecommunications Conference (GLOBECOM), vol. 3, 1680-1686 (November, 1989), and O. J. Joeressen et al., “High-Speed VLSI Architectures for Soft-Output Viterbi Decoding,” Journal of VLSI Signal Processing, vol. 8, 169-181 (1994), incorporated by reference herein. It is important to distinguish the terms “one-step SOVA” and “two-step SOVA” from the term “multiple-step trellis.” While the term “n-step SOVA” indicates the number of steps, n, required to perform Viterbi and reliability processing, the term “multiple-step trellis” indicates a trellis obtained from a one-step trellis by collapsing transitions over multiple time steps into one.

Two-Step SOVA for a One-Step Trellis

FIG. 1 shows a one-step trellis 100, where a state is defined by the two most recent state bits b₀b⁻¹ and denoted as state(b₀b⁻¹). This trellis corresponds e.g. to an ISI channel with memory L=2. The bit b₀ is associated with the transition: state(b ⁻¹ b ⁻²)→state(b ₀ b ⁻¹).

FIG. 2 illustrates the two-step SOVA for an expanded version 200 of the trellis 100 shown in FIG. 1. The two-step SOVA is explained, e.g., in O. J. Joeressen and H. Meyr, “A 40 Mb/s Soft-Output Viterbi Decoder,” IEEE Journal of Solid-State Circuits, Vol. 30, 812-18 (July, 1995). The first step of the two-step SOVA determines the maximum likelihood (ML) path 210 in FIG. 2, in a similar manner to the conventional Viterbi algorithm. FIG. 2 illustrates the steady-state of the Viterbi algorithm at time step n=3, after the four survivor paths into all four states {state(b₃b₂)} have been determined. The starting state 250 {state(b₀b⁻¹)} of the ML path 210 can be identified by a D-step trace-back from the {state(b_(D)b_(D−1))} with the minimum path metric, where D is the path memory depth of the survivor memory unit. In the example of FIG. 2, it is assumed that D=3.

In the second step of the two-step SOVA, the reliabilities for the bit decisions along the ML path 210 terminating in the starting state state(b₀b⁻¹) are updated. The reliability update depth is denoted by U.

Let b′₀, b′⁻¹, . . . denote the state bits for the ML path 210 that terminates in the starting state state(b′₀, b′⁻¹). Also, let {tilde over (b)}₀, {tilde over (b)}⁻¹, . . . denote the state bits for the competing, losing path 230 in FIG. 2 that terminates in the starting state, state({tilde over (b)}₀,{tilde over (b)}⁻¹)=state(b′₀, b′⁻¹).

The absolute path metric difference between the ML path 210 and competing path 230 into the starting state, state(b′₀, b′⁻¹), is denoted by Δ′₀. The U intermediate reliabilities for the bits b′₀, b′⁻¹, . . . , b′_(−U+1) that are updated using Δ′₀ are denoted by R′_(0,0), R′_(−1,0), . . . , R′_(−U+1,0), respectively. The reliabilities are updated according to following rule:

${{{initialization}\text{:}\mspace{14mu} R_{0,{- 1}}^{\prime}} = {+ \infty}},{i = 0},{- 1},\ldots\mspace{11mu},{{{- U} + {1\text{:}\mspace{14mu} R_{i,0}^{\prime}}} = \left\{ {{{\begin{matrix} {\min\left( {R_{i,{- 1}}^{\prime},\Delta_{0}^{\prime}} \right)} & {{{{if}\mspace{14mu} b_{i}^{\prime}} \neq {\overset{\sim}{b}}_{i,0}},} \\ R_{i,{- 1}}^{\prime} & {{otherwise},} \end{matrix}R_{{- U} + 1}^{\prime}} = R_{{{- U} + 1},0}^{\prime}},} \right.}$ where R′_(−1,−1), R′_(−2,−1), . . . R′_(−U+1,−1) are the intermediate reliabilities that were updated in the previous clock cycle using the path metric difference Δ′⁻¹ for the starting state state(b′⁻¹, b′⁻²), and R′_(−U+1) is the final reliability for bit b′_(U+1).

It can be seen from the updating formula that the reliability for bit b′₀ is first initialized to infinity (R′_(0,−1)=+∞). Then, as the starting state 250 for the ML path 210 moves from state(b′₀, b′⁻¹) to state(b′_(U−1), b′_(U−2)), and as corresponding absolute path metric differences Δ′₀ to Δ′_(U−1) become available, the reliability for bit b′₀ is updated U times by using either the previous reliability, if the bit b′₀ agrees with the bit of the respective competing path, or using the minimum of the path metric difference and previous reliability.

The updating of reliabilities is shown in FIG. 2 for U=3, where the ML path 210 and competing path 230 merge into the starting state state(b′₀, b′⁻¹)=state(00), and the intermediate reliabilities R′_(0,0), R′_(−1,0), and R′_(−2,0) are updated based on the path metric difference Δ′₀ and the respective intermediate reliabilities from the previous updating procedure, i.e. R′_(−1,−1) and R′_(−2,−1). In the example of FIG. 2, only R′_(−2,0) is updated by taking the minimum of R′_(−2,−1) and Δ′₀, as the bits b′⁻² and {tilde over (b)}_(−2,0) differ from each other.

SOVA Architecture for a One-Step Trellis

FIG. 3 is a schematic block diagram showing a SOVA detector for a one-step trellis 300 (referred to in the following as a one-step-trellis SOVA detector). As shown in FIG. 3, a one-step-trellis SOVA detector 300 processes a received signal to generate soft decisions, in a well known manner. Each soft decision includes the detected bit and a corresponding reliability value. The SOVA detector 300 generates soft decisions at the same rate, f_(s), at which the input signals are received, f_(R). For a more detailed discussion of the SOVA, see, for example, J. Hagenauer and P. Hoeher, “A Viterbi Algorithm with Soft-Decision Outputs and its Applications,” IEEE Global Telecommunications Conference (GLOBECOM), vol. 3, 1680-1686 (November, 1989).

SOVA Detection at Higher Data Rates

FIG. 4 illustrates a one-step trellis 400 for an ISI channel having a memory L=3. There are eight channel states, and two branches corresponding to the bits b_(n)=0 and b_(n)=1 leave each state, state(b⁻¹b⁻²b⁻³), to reach a respective successor state, state(b₀b⁻¹b⁻²).

As previously indicated, the present invention increases the maximum data rate that may be achieved by a SOVA detector by transforming the original one-step trellis 400 into a multiple-step trellis 500, shown in FIG. 5. FIG. 5 illustrates an exemplary two-step trellis 500 for an ISI channel having a memory L=3, corresponding to the one-step trellis 400 of FIG. 4, in accordance with the present invention. The trellises in both FIGS. 4 and 5 are for the illustrative case that the channel memory is equal to L=3. While the present invention is described using the exemplary two-step trellis 500 of FIG. 5, the invention generalizes to cases where more than two steps are processed at once in a multiple-step trellis, as would be apparent to a person of ordinary skill in the art. As shown in FIG. 5, when one step is processed in the two-step trellis 500, two steps from the original one-step trellis 400 are processed at once. In this manner, if a two-step trellis is used, the maximum data rate that can be achieved in a hardware implementation is effectively increased by a factor of about two compared to a one-step-trellis implementation. A higher data rate increase can be achieved if more than two steps from the original one-step trellis are processed at once in the multiple-step trellis.

SOVA Architecture for a Two-Step Trellis

FIG. 6 is a schematic block diagram showing a SOVA implementation for a two-step trellis 600 (also referred to in the following as a two-step-trellis SOVA detector) incorporating features of the present invention. As shown in FIG. 6, the serial received signal is converted to a parallel signal at stage 610 and the parallel signals are processed by the two-step-trellis SOVA detector 600, for example, using the exemplary implementation discussed below in conjunction with FIG. 7. The two-step-trellis SOVA detector 600 generates the detected bits and reliabilities at half the rate, f_(s)=½·f_(R), at which the input signals are received, f_(R). Thus, two soft decisions are generated per clock cycle. The parallel output of the two-step trellis SOVA detector 600 may be converted to a serial signal at stage 650.

FIG. 7 illustrates a schematic block diagram of an exemplary two-step SOVA architecture 700 for a two-step trellis incorporating features of the present invention. As shown in FIG. 7, the exemplary SOVA architecture 700 for a two-step trellis comprises a branch metric unit (BMU) 710.

The BMU 710 is explained for the two-step trellis shown in FIG. 5 without loss of generality. The BMU 710 computes one-step-trellis branch metrics, m(0000), m(0001), . . . , m(1111), as follows: m(b ₀ b ⁻¹ b ⁻² b ⁻³)=[y−e(b ₀ b ⁻¹ b ⁻² b ⁻³)]², where the subtracted term e(b₀b⁻¹b⁻²b⁻³) is the ideal (noise-less) channel output under the condition that the state bit block (on which the ideal output depends) is b₀b⁻¹b⁻²b⁻³.

In each two-step-trellis clock cycle, each one-step-trellis branch metric is used as a summand in two distinct two-step-trellis branch metrics. The two-step-trellis branch metric for the 5 state bits b₀b⁻¹b⁻²b⁻³b⁻⁴, where b₀ is the most recent bit at the later one-step-trellis period of the two-step-trellis cycle, is given by: m _(branch)(b ₀ b ⁻¹ b ⁻² b ⁻³ b ⁻⁴)=m(b ⁻¹ b ⁻² b ⁻³ b ⁻⁴)+m(b ₀ b ⁻¹ b ⁻² b ⁻³).

In addition, the exemplary two-step-trellis SOVA architecture 700 comprises an add-compare-select unit (ACSU) 900, discussed below in conjunction with FIGS. 9 and 10, a survivor memory unit (SMU) 1100, discussed below in conjunction with FIG. 11, a path comparison unit 1200, discussed below in conjunction with FIGS. 12 and 13, a reliability unit 1400, discussed below in conjunction with FIG. 14, and a number of delay operators D1-D3.

The BMU 710, ACSU 900, and SMU 1100 implement the first step of the two-step SOVA, i.e., maximum-likelihood sequence detection using the Viterbi algorithm. The second step of the two-step SOVA is implemented by the path comparison unit 1200, which computes the paths that compete with a respective win-win path, and the reliability update unit 1400, which updates the reliabilities for the ML path.

Path Metric Difference and ACS Decision Definitions

A conventional one-step-trellis SOVA implementation computes one absolute path metric difference per state at each (one-step-trellis) clock cycle, as described, e.g., in O. J. Joeressen and H. Meyr, “A 40 Mb/s Soft-Output Viterbi Decoder,” IEEE Journal of Solid-State Circuits, Vol. 30, 812-18 (July, 1995). The present invention recognizes that in the exemplary implementation for a two-step trellis, where two steps from the original one-step trellis 400 are processed at once, two path metric differences are computed per state at each (two-step-trellis) clock cycle. Thus, as discussed below in conjunction with FIG. 9 and FIG. 10, the ACSU 900 generates, for each state, two path metric differences Δ⁻¹, and Δ₀ for the first and second period of the (two-step-trellis) clock cycle.

FIG. 8 illustrates the computation of the path metric differences Δ⁻¹ and Δ₀ in a two-step-trellis SOVA detector 600 for the exemplary one-step and two-step trellises 400 and 500, where n is the one-step-trellis time index and m is the two-step-trellis time index. In a two-step-trellis SOVA implementation, each two-step-trellis cycle contains two one-step-trellis periods. For example, as shown in FIG. 8, the cycle associated with the two-step-trellis index m=0 contains the two one-step-trellis periods associated with the one-step-trellis indices n=0 and n=−1. FIG. 8 shows four competing paths 810, 820, 830, 840. Each path 810, 820, 830, 840 can be indentified with a respective two-bit selection signal indicating whether the path wins or loses in each one-step-trellis period of the two-step-trellis cycle into the state that terminates in the state defined by the 3-bit block b₀b⁻¹b⁻²=000. For example, the win-lose path 810 wins (relative to the lose-lose path) in the first period (n=−1) and loses (relative to the win-win path) in the second period (n=0) of the two-step-trellis cycle.

FIG. 8 shows the four competing paths 810, 820, 830 and 840 that terminate in the state defined by the 3-bit block b₀b⁻¹b⁻²=000.

The path metric difference Δ₀ for the second period of the two-step-trellis cycle, into the state associated with the one-step-trellis index n=0, is the difference between the win-win path segment 820-0 and the win-lose path segment 810-0. The path metric difference Δ⁻¹ for the first period of the two-step-trellis cycle, into the respective state associated with the one-step-trellis index n=−1, is the difference between the win-win path segment 820-1 and the lose-win path segment 830-1.

In a conventional one-step-trellis SOVA implementation, the ACS generates a single ACS decision, e, indicating, for each state, which branch to trace back along the winning path through the trellis. According to an exemplary convention, a value of e=0 provides an indication to trace back the upper branch from a state. The present invention recognizes that in a two-step-trellis SOVA implementation, the ACS 900 needs to generate, for each two-step-trellis cycle, two-bit ACS decisions ef, indicating, for each two-step-trellis cycle, which branches to trace back along the win-win path through the trellis, where e corresponds to the first period and f to the second period of the two-step-trellis cycle. Thus, a two-bit ACS decision of ef=00 provides an indication to trace back the upper branches out of the state that terminates in the state defined by the 3-bit block b₀b⁻¹b⁻²=000 through the trellis 800 along the win-win path 820 to the state defined by the 3-bit block b⁻²b⁻³b⁻⁴=000.

Again, the path metric difference Δ₀ for the second period of the two-step-trellis cycle is the difference between the win-win path segment 820-0 and the win-lose path segment 810-0. Similarly, the path metric difference Δ⁻¹ for the first period of the two-step-trellis cycle is the difference between the win-win path segment 820-1 and the lose-win path segment 830-1. Thus, to compute the path metric differences, Δ₀ and Δ⁻¹, three different paths need to be distinguished (win-win path 820, win-lose path 810, and lose-win path 830). The two-bit ACS decisions ef, however, only allows two of these paths to be distinguished. The win-win path 820 can be identified using the two-bit ACS decision ef=00. The lose-win path 830 can be identified using the two-bit selection signal e f=01, which can be derived from the ACS decision by using e and inverting f ( f denotes the inversion of f). While the second win-lose path segment 810-0 can be identified in terms of the ACS decision e, i.e. by ē=1, the first win-lose path segment 810-1 cannot be identified in terms of the ACS decision, f. Thus, in order to sufficiently define the win-lose path 810 through the two-step trellis, an additional selection signal F is generated, as discussed further below.

The best path, i.e., the win-win path 820 into state(b₀b⁻¹b⁻²) is given by the bit sequence b₀b⁻¹b⁻²b⁻³b⁻⁴=b₀b⁻¹b⁻²ef=00000.

The lose-win-path 830 is thus the path that lost to the win-win path 820 in the first period of the two-step-trellis cycle and then became part of the win-win path 820. This path 830 is given by the bit sequence b₀b⁻¹b⁻²b⁻³b⁻⁴=b₀b⁻¹b⁻²e f=00001, and it can be traced back from state(b₀b⁻¹b⁻²) to state state(b⁻¹b⁻²e), and then from state(b⁻¹b⁻²e) to state(b⁻²e f) using the ACS decision e and the inverted ACS decision f. The path metric difference Δ⁻¹ is defined as the path metric difference between the win-win path segment 820-1 and the lose-win path segment 830-1.

The win-lose-path 810 is the winning path into state(b⁻¹b⁻²ē) and the losing path into state(b₀b⁻¹b⁻²). Denote the one-step-trellis ACS decision for the two paths into state state(b⁻¹b⁻²ē) by F. Then, the win-lose-path 810 can be traced back from state(b₀b⁻¹b⁻²) to state(b⁻¹b⁻²ē) and then to state(b⁻²ēF). In the example of FIG. 8, the win-lose path 810 is given by the state sequence b₀b⁻¹b⁻²b⁻³b⁻⁴=b₀b⁻¹b⁻²ēF=00010. The path metric difference Δ₀ is defined as the path metric difference between the win-win path segment 820-0 and win-lose path segment 810-0.

The lose-lose-path 840 can be traced back from state(b₀b⁻¹b⁻²) to state(b⁻¹b⁻²ē) and state(b⁻² eF), but it is not of importance for the computation of the path metric differences Δ⁻¹ and Δ₀.

In summary, for each state(b₀b⁻¹b⁻²) two path metric differences Δ₁ and Δ₀ are computed, the former for the first period and the latter for the second period of a two-step-trellis cycle. The lose-win path 830 can be traced back from state(b₀b⁻¹b⁻²) to state(b⁻²e f) using the two-bit selection signal e f, and the win-lose path 810 can be traced from state(b₀b⁻¹b⁻²) to state(b⁻²ēF) using the two-bit selection signal ēF.

Returning to FIG. 7, the path metric differences Δ₀ and Δ⁻¹, and the ACS decisions e, f and F are delayed in the delay buffers D2 for a time that is equal to the delay of the path memory and the delay buffer D1. The path comparison unit 1200 generates, for each state and bit within the reliability update window, an equivalence bit that indicates whether the win-win path and a respective competing path agree in terms of the bit decision. The path metric differences and equivalence bits that correspond to the starting state of the ML path are selected based on a selection signal that is defined by the state bits in the delay buffer D1. The state bits for the ML path at the output of SMU are first stored in the delay buffer D1 and then in the delay buffer D3.

ACSU

FIG. 9 is a schematic block diagram showing an exemplary implementation of the ACSU 900 of FIG. 7 and the generation of path metric differences Δ⁻¹ and Δ₀ and the additional ACS decision F. The exemplary ACSU 900 considers an 8-state two-step trellis with 4 transitions per state, such as the trellis 500 shown in FIG. 5, in which each state is defined by the past 3 state bits b₀b⁻¹b⁻². Each two-step-trellis branch metric m_(branch)(b₀b⁻¹b⁻²b⁻³b⁻⁴) depends on the 3 state bits b⁻²b⁻³b⁻⁴ that define the starting state of a transition in the two-step trellis 800, and also on the 2 state bits b₀b⁻¹ that correspond to the path extension. The path metric for above path extension is computed by: m′ _(path)(b ₀ b ⁻¹ b ⁻² b ⁻³ b ⁻⁴)=m _(path)(b ⁻² b ⁻³ b ⁻⁴)+m _(branch)(b ₀ b ⁻¹ b ⁻² b ⁻³ b ⁻⁴), where m_(path) (b⁻²b⁻³b⁻⁴) is the path metric for the winning path into state state(b⁻²b⁻³b⁻⁴) at the previous two-step-trellis cycle.

For each state, the ACSU performs the ACS operation to determine the winning path using a set of adders 910, a comparator 920 and a selector 930. For example, for state(000), the four path metrics for the path extensions into this state are computed as m′ _(path)(00000)=m _(path)(000)+m _(hrate)(00000) m′ _(path)(00010)=m _(path)(010)+m _(hrate)(00010) m′ _(path)(00001)=m _(path)(001)+m _(hrate)(00001) m′ _(path)(00011)=m _(path)(011)+m _(hrate)(00011)

The path metric for the winning path 820 into state(b₀b⁻¹b₂) is determined with a 4-way comparison 920 among the path metrics for the 4 path extensions into this state, i.e., it is the minimum of the 4 values m′_(path)(b₀b⁻¹b⁻² 00), m′_(path)(b₀b⁻¹b⁻²10), m′_(path)(b₀b⁻¹b⁻²01), and m′_(path)(b₀b⁻¹b⁻² 11).

In the ACSU 900, the path metric differences Δ⁻¹ and Δ₀ are computed after the two-step-trellis ACS operation, as shown in FIG. 9. The two-bit, two-step-trellis ACS decision ef generated by the comparator 920 is used to select the path metric for the winning path (also referred to as the win-win path 820) at the selector 930 as in a conventional two-step-trellis ACSU. The path metric 940 of the lose-win path 830 is chosen by a selector 950 using the 2-bit selection signal e f. The path metric difference Δ⁻¹ is computed by taking the absolute value of the difference between the path metric of the win-win path 820 and lose-win path 830, as computed by a subtractor 955.

The win-lose path 810 and lose-lose path 840 are chosen using two 2-to-1 multiplexers 960, 965, based on the selection signal ē. This is equivalent to selecting the win-lose and lose-lose path 840 using two 4-to-1 multiplexers that are driven by the 2-bit selection signals ē0 and ē1 respectively. The two selected path metrics are compared by a comparator 970 to identify the path metric 975 of the win-lose path 810, and the corresponding ACS decision F is generated. The path metric 975 is selected by the selector 972. The path metric difference Δ₀ is computed by a subtractor 980 that computes the absolute value of the difference between the path metric of the win-win path 820 and win-lose path 810.

FIG. 10 shows an alternate implementation of the ACS operation and generation of the path metric differences Δ⁻¹ and Δ₀. For each state, the ACSU 1000 performs the ACS operation to determine the winning path using a set of adders 1010, a set of comparators 1020, selection logic and a selector 1030. The path metric for the winning path 820 into state(b₀b⁻¹b₂) is determined with six parallel concurrent two-way comparisons 1020. For a more detailed discussion of the implementation of the ACS operation for multiple-step trellises using parallel concurrent comparisons, see U.S. patent application Ser. No. 10/853,087, entitled “Method and Apparatus for Multiple-Step Viterbi Detection with Local Feedback,” filed on May 25, 2004 and incorporated by reference herein.

In the ACSU 1000, the path metric differences Δ⁻¹ and Δ₀ are selected or computed after the two-step-trellis ACS operation, as shown in FIG. 10. The two-bit, two-step-trellis ACS decision ef generated by the selection logic 1030 is again used to select the path metric for the winning path (also referred to as the win-win path 820) by a selector 1035 as in a conventional two-step-trellis ACSU. The path metric difference Δ⁻¹ is selected by a selector 1045 (controlled by selection logic 1040 that processes the 2-bit ACS decision ef) that selects the output of the appropriate comparator 1020 that produced the absolute value of the difference between the path metric of the win-win path 820 and lose-win path 830.

Similarly, the path metric difference Δ₀ is selected by a selector 1055 (controlled by selection logic 1050 that processes the first bit, e, of the 2-bit ACS decision ef and the selection signal F) that selects the output of the appropriate comparator 1020 that produced the absolute value of the difference between the path metric of the win-win path 820 and win-lose path 810.

The ACS decision F is generated in the ACSU 1000 as follows. The path metric difference between the win-win path 820 and win-lose path 810 and the path metric difference between the win-win-path 820 and the lose-lose path 840 are chosen using two selectors 1060, 1065, each of which is controlled by selection logic that processes the 2-bit ACS decision ef. The two selected path metric differences are compared by a comparator 1070 to generate the corresponding ACS decision F.

SMU

FIG. 11 is a schematic block diagram showing an exemplary implementation of the survivor memory unit 1100 of FIG. 7. Generally, the SMU 1100 stores and updates the state bits for all 8 survivor paths using a conventional register-exchange architecture, where the multiplexers 1110 are controlled by the two-bit, two-step-trellis ACS decision ef. FIG. 11 shows the double row of the survivor memory unit 1100 that stores the odd and even state bits {circumflex over (b)}₀, {circumflex over (b)}⁻¹, {circumflex over (b)}⁻², {circumflex over (b)}⁻³, {circumflex over (b)}⁻⁴, {circumflex over (b)}⁻⁵, . . . along the survivor path into state(b₀b⁻¹b⁻²)=state(000). The top row in the exemplary embodiment processes the predefined state bit b₀ and corresponding predefined state bits from other states, under control of the ACS decision ef, whereas the bottom row processes the predefined state bit b⁻¹ and corresponding predefined state bits from other states, under control of the ACS decision ef. A double row structure similar to the one of FIG. 11 is implemented for all 8 states. Per state and stored survivor bit pair, the SMU 1100 implements two multiplexers 1110 and two registers 1120 as a constituent functional unit. The SMU 1100 produces at the output the final survivor bits {circumflex over (b)}_(−D+2) and {circumflex over (b)}_(−D+1), where D is the path memory depth. In the exemplary embodiment 1100, D=8. For a discussion of the register-exchange SMU architecture, see, e.g., R. Cypher and C. B. Shung, “Generalized Trace-Back Techniques for Survivor Memory Management in the Viterbi Algorithm,” Journal of VLSI Signal Processing, 85-94 (1993).

The ML path 820 is the path with the overall minimum path metric. The survivor bits {circumflex over (b)}_(−D+2) and {circumflex over (b)}_(−D+1) that correspond to the state with the overall minimum path metric are provided to the delay buffer D1 (FIG. 7) and denoted as b′_(−D+2) and b′_(−D+1). These bits are the state bits for the ML path 820, and they both determine the starting state for the reliability update operation and also the final bit decisions.

Delay Buffers D1, D2, and D3

As previously indicated, the two-step-trellis SOVA architecture 700 of FIG. 7 comprises a number of delay buffers D1-D3. The delay buffer D1 delays the state bits at the end of the SMU 1100 that belong to the ML path 820 by two two-step-trellis clock cycles. The final three bits of this buffer D1 define the starting state for the second step of the two-step SOVA. The starting state signal is used to select the path metric differences and equivalence bits for the ML path.

The ACS decisions e, f, F and the path metric differences Δ⁻¹, Δ₀ for all states are also delayed in the delay buffers D2. The delay of D2 is equal to the sum of the delay of the path memory and the buffer D1. The delay buffer D3 further delays the state bits that are outputted by the buffer D1. The delay of D3 is equal to the delay of the reliability update unit.

Path Comparison Unit

As previously indicated, the path comparison unit 1200, shown in FIG. 12 and FIG. 13, computes for each state the paths that compete with the survivor path, i.e., win-win path 820. In addition, the path comparison unit 1200 generates, for each state and bit within the reliability update window, an equivalence bit that indicates whether the win-win path 820 and a competing path agree in terms of the bit decision. In FIGS. 12 and 13, only the rows for state(b₀b⁻¹b⁻²)=state(000) are shown.

FIG. 12 is a schematic block diagram showing an exemplary implementation of the path comparison unit 1200-even for bits corresponding to even one-step-trellis periods and FIG. 13 is a schematic block diagram showing an exemplary implementation of the path comparison unit 1200-odd for bits corresponding to odd one-step-trellis periods (collectively referred to as the path comparison unit 1200). The path comparison unit 1200 receives at each two-step-trellis cycle for each state the delayed ACS decisions e, f and F, from which the selection signals ef, e f and ēF are derived. The path comparison unit 1200 stores and updates the bits that correspond to all survivor paths. The path comparison unit 1200 also computes equivalence bits for each surviving bit: an equivalence bit is 1 if the bit for the survivor path 820 and competing path disagree, and 0 otherwise.

The survivor bits {circumflex over (b)}₀, {circumflex over (b)}⁻¹, {circumflex over (b)}⁻², {circumflex over (b)}⁻³, {circumflex over (b)}⁻⁴, {circumflex over (b)}⁻⁵, . . . are generated as shown in FIG. 12 for even one-step-trellis periods and in FIG. 13 for odd one-step-trellis periods of a two-step-trellis cycle.

In FIGS. 12 and 13, the survivor bits of the win-lose path 810, which are selected by ēF, and the survivor bits of the lose-win path 830, which are selected by e f, are compared to the survivor bits of the win-win path 820, which are selected by ef, to generate corresponding equivalence bits. The path comparison unit 1200 resembles the register-exchange implementation of the survivor memory unit 1100. The bottom row of the path comparison units 1200-even, 1200-odd contain registers 1220 and multiplexers 1210 that store and select the survivor paths for every state.

In addition, the top and middle rows of the path comparison units 1200-even, 1200-odd contain two multiplexers 1210 per one-step-trellis period and state that select the bits of the competing lose-win path 830 and win-lose path 810 using the selection signals e f and ēF, respectively, and there are two XOR gates that generate respective equivalence bits indicating whether the bit for the respective path (the lose-win path 830 or win-lose path 810 associated with the selection signals e f and ēF) and the bit for the winning (win-win) path are equivalent. The notation q_(−2,0) indicates the equivalence bit for survivor bit {circumflex over (b)}⁻² and path metric difference Δ′₀, while q_(−2,−1) indicates the equivalence bit for survivor bit {circumflex over (b)}⁻² and path metric difference Δ′⁻¹. Each column of the path comparison units 1200-even, 1200-odd corresponds to an even and odd one-step-trellis period, respectively.

A structure similar to the one shown in FIGS. 12 and 13 is required for each state. While FIGS. 12 and 13 show a number of columns each containing three multiplexers, two XOR gates and one register, the first column in FIG. 12 only includes two multiplexers, one XOR gate and one register, as it computes only one equivalence bit, i.e. q_(0,0), in the exemplary embodiment. The path comparison unit generates, for each state, equivalence bits up to q_(−U+2,−1), q_(−U+2,0) and q_(−U+1,−1), q_(−U+1,0), respectively, where U is the reliability update length. In the exemplary embodiment 1200, U=6.

Reliability Update Unit

FIG. 14 is a schematic block diagram showing an exemplary implementation of the reliability update unit 1400 of FIG. 7 that updates the reliabilities for the maximum-likelihood path 820. The exemplary reliability update unit 1400 computes and stores two reliability values per two-step-trellis cycle.

Δ′⁻¹ and Δ′₀ are the delayed path metric differences for the ML path 820 into the starting state (see FIG. 7). These two values are selected among the buffered path metric differences using the starting state signal as shown in FIG. 7.

q′_(0,0), q′_(−1,−1), q′_(−1,0), q′_(−2,−1), q′_(−2,0), q′_(−3,−1), q′_(−3,0), . . . are the equivalence bits for the ML path into the starting states state(b′⁻¹b′⁻²b′⁻³) and state(b₀b′⁻¹b⁻²). These signals are selected among the equivalence bits computed in the path comparison unit (see FIGS. 12 and 13) using the starting state signal as shown in FIG. 7.

The reliabilities R′_(0,0), R′_(−1,0), R′_(−2,0), R′_(−3,0), R′_(−4,0), R′_(−5,0), . . . are updated based on Δ′₀, whereas R′_(−1,−1), R′_(−2,−1), R′_(−3,−1), R′_(−4,−1), R′_(−5,−1) . . . are updated based on Δ′⁻¹.

R_(max) is a hard-wired value and denotes the maximum reliability value, e.g., R_(max)=∞. The first reliabilities R′_(0,0) and R′_(−1,−1) consider R_(max) as an initialization value in the exemplary embodiment.

After initialization, a functional element, such as the exemplary functional element 1410, comprises four functional units, such as the exemplary functional unit 1420, and two registers. Each functional unit 1420 comprises a comparator, a multiplexer and an AND gate. The top row of the reliability update unit 1400 computes reliability values for even one-step-trellis periods and the bottom row computes reliability values for odd one-step-trellis periods. For example, R′_(0,0) (computed in the previous two-step-trellis cycle) and Δ′⁻¹ are used to compute R′_(−2,−1), under control of the corresponding equivalence bit q′_(−2,−1). Thereafter R′_(−2,−1) and Δ′₀ are used to compute R′_(−2,0) under control of the corresponding equivalence bit q′_(−2,0). Thus, two functional units operate in series to first compute R′_(−2,−1) and then R′_(−2,0). In an analogous fashion, two functional units operate in series to first compute R′_(−3,−1) and then R′_(−3,0), by using the path metric differences, Δ′⁻¹ and Δ′₀, and corresponding equivalence bits. In summary, two groups of functional units operate in parallel to compute the reliability values R′_(−2,0) and R′_(−3,0) for the same two-step-trellis cycle, where each group comprises two functional units that operate in series.

The reliability unit 1400 computes the final reliabilities R′_(−U+2)=R′_(−U+2,0) and R′_(−U+1)=R′_(−U+1,0), where U is the reliability update length. Soft decisions S′_(i) are generated based on the final reliability values and corresponding bit decisions, e.g. according to the rule:

$S_{i}^{\prime} = \left\{ {\begin{matrix} R_{i}^{\prime} & {{{if}\mspace{14mu} b_{i}^{\prime}} \equiv 0} \\ {- R_{i}^{\prime}} & {{{if}\mspace{14mu} b_{i}^{\prime}} \equiv 1} \end{matrix}.} \right.$

It is to be understood that the embodiments and variations shown and described herein are merely illustrative of the principles of this invention and that various modifications may be implemented by those skilled in the art without departing from the scope and spirit of the invention. 

1. A path comparison unit that determines a plurality of paths in a trellis that compete with a survivor path in said trellis, comprising: a first type functional unit comprising a multiplexer and a register to store one or more survivor bits associated with said survivor path; and at least two second type functional units, wherein each second type functional unit comprises a multiplexer and a logical circuit to compute at least one equivalence bit indicating whether the bit for a respective path and the bit for said survivor path are equivalent.
 2. The path comparison unit of claim 1, wherein the respective path is one or more of a win-lose path and a lose-win path.
 3. The path comparison unit of claim 1, wherein a plurality of said first type functional units are connected in a register exchange architecture.
 4. The path comparison unit of claim 1, wherein said first type functional unit and said two second type functional units comprise a functional element.
 5. The path comparison unit of claim 4, wherein the first type functional unit and the two second type functional units within a functional element operate in parallel.
 6. The path comparison unit of claim 4, wherein said logical circuit compares an output of two multiplexers in the same functional element.
 7. The path comparison unit of claim 4, wherein a plurality of said functional elements are configured in an array structure having a plurality of rows and a plurality of columns.
 8. The path comparison unit of claim 7, wherein an output of one of said first type functional units in a first column is applied as an input to at least one functional element in a subsequent column.
 9. The path comparison unit of claim 7, wherein a row in said array structure corresponds to a given state.
 10. The path comparison unit of claim 7, wherein a column in said array structure corresponds to a one-step-trellis period.
 11. The path comparison unit of claim 7, wherein inputs to a given one of said multiplexers comprise at least one survivor bit from at least one functional element of a previous column.
 12. The path comparison unit of claim 7, wherein each multiplexer in said first type functional units in a given row of said array structure is controlled by the same selection signal.
 13. The path comparison unit of claim 7, wherein said at least two second type functional units in given row of said array structure comprise first and second multiplexers operating in parallel and wherein said first multiplexers in said given row are controlled by a first selection signal and said second multiplexers in said given row are controlled by a second selection signal.
 14. The path comparison unit of claim 7, wherein a plurality of said functional elements in said array structure are connected according to said trellis.
 15. The path comparison unit of claim 1, wherein said at least one equivalence bit is computed for at least one state and hit within a reliability update window.
 16. The path comparison unit of claim
 1. further comprising a first path comparison unit for bits corresponding to even one-step-trellis periods and a second path comparison unit for bits corresponding to odd one-step-trellis periods.
 17. The path comparison unit of claim 1, wherein said multiplexers are controlled by at least one selection signal based on an add-compare-selection decision.
 18. The path comparison unit of claim 1, wherein survivor bits of a win-lose path and a lose-win path are compared to survivor bits of said survivor path to generate corresponding equivalence bits.
 19. The path comparison unit of claim 1, wherein said equivalence bits are provided to a reliability unit that determines a reliability value for at least one bit decision.
 20. The path comparison unit of claim 1, wherein said logical circuit performs an exclusive OR (XOR) function.
 21. A circuit, comprising: a first type functional unit comprising a multiplexer and a register: and at least two second type functional units, wherein each second type functional unit comprises a multiplexer and a logical circuit, wherein (i) said first type functional unit and said at least two second type functional units operate in parallel, (ii) said multiplexers in said first type functional unit and said at least two second type functional units each process a same set of input signals, (iii) a first logical circuit in a first of said at least two second type functional units processes an output from a First multiplexer in said first of said at least two second type functional units and an output from said multiplexer in said first type functional unit: and (iv) a second logical circuit in a second of said at least two second type functional units processes an output from a second multiplexer in said second of said at least two second type functional units and an output from said multiplexer in said first type functional unit.
 22. The circuit of claim 21, wherein a plurality of said first type functional units are connected in a register exchange architecture.
 23. The circuit of claim 21, wherein said logical circuit performs an exclusive OR (XOR) function.
 24. The circuit of claim 21, wherein a plurality of said circuits are configured in an array structure having a plurality of rows and a plurality of columns.
 25. The circuit of claim 24, wherein said first type functional unit and said two second type functional units comprise a functional element and wherein an output of said first type functional unit is applied to at least one functional element of a subsequent column.
 26. The circuit of claim 24, wherein inputs to a given one of said multiplexers comprise at least one value from at least one circuit of a previous column.
 27. The circuit of claim 24, wherein each multiplexer in said first type functional units in a given row of said array structure is controlled by the same selection signal.
 28. The circuit of claim 24, wherein said first multiplexers of said second type functional units in said given row are controlled by a first selection signal and said second multiplexers of said second type functional units in said given row are controlled by a second selection signal.
 29. The circuit of claim 24, wherein a plurality of said circuits in said array structure are connected according to a trellis. 